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Abstract: The diatom Phaeodactylum tricornutum can accumulate eicosapentaenoic acid 
(EPA) up to 30% of the total fatty acids. This species has been targeted for isolating gene 
encoding desaturases and elongases for long-chain polyunsaturated fatty acid (LC-PUFA) 
metabolic engineering. Here we first report the cloning and characterization of A5-elongase 
gene in P. tricornutum. A full-length cDNA sequence, designated PhtELOS, was shown to 
contain a 1110 bp open reading frame encoding a 369 amino acid polypeptide. The 
putative protein contains seven transmembrane regions and two elongase characteristic 
motifs of FLHXYHH and MYSYY, the latter being typical for microalgal A5-elongases. 
Phylogenetic analysis indicated that PhtELOS belongs to the EL05 group, tightly clustered 
with the counterpart of Thalassiosira pseudonana. Heterologous expression of PhtELOS in 
Pichia pastoris confirmed that it encodes a specific A5-elongase capable of elongating 
arachidonic acid and eicosapentaenoic acid. Co-expression of PhtELOS and IsFAD4 
(a A4-desaturase irom Isochrysis sphaerica) demonstrated that the high-efficiency 
biosynthetic pathway of docosahexaenoic acid was assembled in the transgenic yeast. 
Substrate competition revealed that PhtELOS exhibited higher activity towards n-3 PUFA 
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than n-6 PUFA. It is hypothesized that Phaeodactylum EL05 may preferentially participate 
in biosynthesis of transgenic LC-PUFA via a «-3 pathway in the yeast host. 

Keywords: diatom fatty acids metabolites; Phaeodactylum A5-elongase; functional 
stacking of EL05 and FAD4 



Abbreviations 

ARA, arachidonic acid; ALA, a-linolenic acid; DHA, docosahexaenoic acid; DTA, 
Docosatetraenoic acid; DP A, docosapentaenoic acid; EPA, eicosapentaenoic acid; FA, fatty acid; 
FAMEs, Fatty acid methyl esters; FID, flame ionization detector; GC-MS, Gas 
Chromatography-Mass Spectrometry; LA, linoleic Acid; LC, long chain; ORF, open reading frame; 
PUFA, polyunsaturated fatty acid. 

1. Introduction 

Polyunsaturated fatty acids (PUFAs) are fatty acids of 18 carbons or more in length with two or 
more double bonds. They can be classified into n-6 and n-3 major groups, depending on the position of 
the first double bond proximate to the methyl end of the fatty acid. PUFAs are important structural 
components that confer membrane fluidity and selective permeability [1-3]. Deficiencies in long chain 
(LC) PUFAs such as DHA and ARA have been associated with disorders in cognitive function, early 
infancy and various other physiological processes. The n-3 PUFAs including DHA and EPA can be 
used as preventive drugs and supplements [3-6]. They can be obtained either through diet of marine 
fish or synthesized from dietary essential fatty acids. However, the marine fish industry is increasingly 
declining due to overfishing and environmental pollution. It is imperative to develop sustainable and 
alternative sources of LC-PUFAs [7,8]. One strategy is to make use of high-efficiency desaturase and 
elongase genes for transgenic production of LC-PUFAs [7]. 

The fatty acid desaturases and elongases play key roles in the processes of the conventional 
A6-pathway, alternative A8-pathway and microbial A4-pathaway [8,9]. The co-3 (or n-3) LC-PUFA 
biosynthesis can start with ALA as substrate which can be produced by some yeasts and plants. The 
biosjaithesis of DHA fi-om ALA involves a series of desaturation and elongation reactions catalyzed by 
various desaturases and elongases. A5-elongase and A4-desaturase are rate-limiting enzymes for DHA 
biosjoithesis [2,9]. Molecular characterizations of desaturase and elongase in transgenic organisms 
were explored with species of yeast, microalgae and plants [9-11]. Not a few desaturase genes have 
been characterized from species of mosses, algae, fungi and others [12-18]. There are few reports 
about the A5-elongases in Pavlova salina, Marchantia polymorpha and Thraustochytrium sp. [19-23]. 
Nevertheless, so far the A5-elongase of Phaeodactylum tricornutum has not been characterized. This 
diatom is rich in n-3 PUFAs with superior genetic resources revealed by genomic research [24,25]. We 
aimed to characterize the diatom elongase and desaturase which can be used for LC-PUFA metabolic 
engineering. In this study, we describe the isolation and characterization of A5 -elongase gene from 
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Phaeodactylum, and the functional stacking of A5-elongase and A4-desaturase in transgenic yeast 
realizing efficient formation of DHA. 

2. Results 

2.1. Isolation of Gene Encoding A 5 -elongase from P. tricomutum 

NCBI search identified a sequence of P. tricomutum mRNA, XM_002 176650.1 that predicted a 
hypothetical protein XP_002 176686.1. NCBI conserved domain database (CDD) analysis indicated 
that this shortened protein belonged to the ELO-super family, sharing 53% identity with the closest 
homologue in T. pseudonana. In order to isolate the full length cDNA and the putative gene, primers 
(Table 1) were designed for amplification of target gene and coding sequence using the High Fidelity 
PCR system (Roche). The PCR fragments were fully sequenced and assembled. The contigl of 
1241 bp was identified from cDNA as a coding sequence containing an open reading frame (ORF) of 
1110 bp in length; and the contig2 from genomic DNA is 1661 bp long containing a partial 5'-UTR 
(415 bp) and a putative structural gene (1212 bp) which includes a 102 bp long intron. Sequencing 
analysis indicated that two sequences of ORF of 1 1 10 bp were identical and designated PhtEL05. 



Table 1. Microbial sfrains, plasmids and primers. 



Strain, Plasmid or Primer 


Characteristic, Use and Source 


Strains 




E.coli Top 10 


E.coli host; for DNA manipulations, Transgene (Beijing, China) 


P.pastorisGSWS 


his4; Invitrogen (Invitrogen China Limited, Beijing, China) 


PHCOl 


GS115 transformed by empty pHBM906 vector, as control 


PHE5.01 


GS115 carrying pHBM-PtEL05, for gene expression 


PHE5d.01 


GS115 carrying pHBM-PtEL05-Al, for gene expression 


PHE5d.02 


GSl 15 carrying pHBM-PtEL05-A2, for gene expression 


PACOl 


GS 1 1 5 carrying pA08 1 5, as control 


PAE5.01 


GS 1 1 5 carrying PtELOS cassette in pAOS 1 5 , for expression 


PDEOl 


GSl 15 carrying FAD4-EL05 cascade in pA0815, for co-expression 


Plasmids 




pMD18-T 


T-cloning vector, Ap"^, Takara; for gene cloning 


pHBM906 


Ap', transformation vector for P. pastoris; stored in our lab 


pHBM-EL05 


Ap', PCR fragment containing PtELOS coding sequence, generated with primers 




Ptelo5-U/ Ptelo5-D, cloned into pHBM906 


pA0815 


Ap^, H1S4, P. pastoris expression vector with AOXl promoter and terminator 


pA0-FAD4 


Ap"^, PCR fragment containing IsFAD4 coding sequence, generated with primers 




1SFAD4E-F/ 1SFAD4E-R, cloned into pA0815 


pA0-EL05 


Ap"^, PCR fragment containing PtELOS coding sequence, generated with primers 




PTEL05E-F/PTEL05E-R, cloned into pA0815 


pT-EL05 


Ap', PCR fragment containing PtELOS coding sequence, generated with primers 




EL05BGL-F/EL05BGL-R, cloned into pMD18-T 


pAO-D4E5 


Ap"^, BgDl digested fragment oiPtELOSORF subcloned into 




fiflwM-digested and dephosphorylated pAO-FAD4 
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Table 1. Cont. 



* Primers 




Ptelo5-Ul 


5'- GGGAGACCAGATGGTCGACG-3' 


Ptelo5-U2 


5'- TCGCGATACCCCGAATATAT-3' 


Ptelo5-U3 


5'- CAGTTGTCCCTTCAGAACAGC-3' 


Ptelo5-U4 


5 - TCGTGTAGAAGAGCGTGGCG-3 


Ptelo5-Dl 


5'- GCTCTGTAATATAGTGCTCTG-3' 


Ptelo5-U 


5'- GTCATGTGTGGTCCCACAGATACAG-3' 


Ptelo5-D 


5'- GGCCACTACGARAGACCGGTCATC-3' 


Ptelo5-dellF 


5'- GTCGATCCACCCGTGCCCTCTCT-3' 


Ptelo5-del2F 


5'- GTCTTGCACAACTGGAAGGTTC-3' 


ISFAD4E-F 


5 - CCGCCGGAATTCGCC ATGTGC AACGCGGCAGTCG-3 


ISFAD4E-R 


5 '- CCGCCGGAATTCTC AATCCGCCTTGAGCGTCTC-3 ' 


PTEL05E-F 


5'- CCGCCGGAATTCGCCATGTGTGGTCCCACAGATAC-3' 


PTEL05E-R 


5 '- CCGCCGGAATTCCTACGAAGACCGGTCATCCC-3 ' 


EL05BGL-F 


5'- GGAAGATCTAACATCCAAAGACGAAAGG-3' 


EL05BGL-R 


5'- GGAAGATCTGCACAAACGAACGTCTCAC-3' 


Co-F 


5'- GCTCATGATCAACGGGCTCTACCA-3' 


Co-R 


5'- TCCCCACACTGCGAAGACACCTAC-3' 


5'AOXl 


5'- GACTGGTTCCAATTGACAAGC-3' 


3'AOXl 


5'- GCAAATGGCATTCTGACATCC-3' 



* Note: Primers Ptelo5-Ul to U4 and -Dl were used for PGR amplifications against putative PhtELOS structural gene 
and cDNA; Ptelo5-U, -dellF, -deI2F, and -D for PCR-cIoning of the full length or truncated PhtELOS into pHBM906; 
ISFAD4E-F and ISFATWE-R for making vector pAO-IsFATW; PTEL05E-F and PTEL05E-R for making pA0-PhtEL05; 
EL05BGL-F and EL05BGL-R for making EL05 expression cassette and the subsequent gene stacking cascade; 
Co-F,Co-R,5'AOXl and 3'AOXl in combination with others for PGR and sequencing confirmation of constructs. 



2.2. Properties of Putative Elongase PhtELOS 

The translated peptide of PhtELOS was analyzed by protein family (Pfam) search, Clustal W and 
TMHMM programs. The results indicated that elongase PhtELOS is an ELO-family protein containing 
seven transmembrane regions and two characteristic motifs FLHXYHH and MYSYY. The motif 
MYSYY is common in all A5-elongases analyzed indicating that it may be unique for EL05 elongases 
(Figure 1). These properties suggested that putative gene PhtELOS is likely to encode fatty acid 
A5-elongase. Phylogenetic analysis indicated that A5-, A6- and A9-elongases included were generally 
grouped into respective EL05-, EL06- and EL09 clusters in accordance with their functions. 
However, there are some exceptions, for example, Marchantia polymorpha elongase MapELOS and 
Thrasutcohytrium elongase ELOl were grouped into EL06 cluster (Figure 2). Among microalgal 
A5-elongases, PhtELOS was strongly clustered with T. pseudonana A5-elongase (ThpEL05) forming 
a tight clade but quite different from other EL05s. Protein sequence distances analyzed using 
DNA-Star (Lasergene) indicated that PhtELOS shares the highest identity of 51% with ThpELOS and 
low identities of 32.6%-38.2% with other members within the EL05 group. Interestingly, although 
Thrasutcohytrium elongase TsELOl shares the lowest identity of 26.1% with PhtELOS, it reportedly 
possessed the activity of ELOS. This may be explained by the existence of motif MYSYY in the 
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TsELOl protein sequence. Together, it is likely that PhtELOS encodes A5-eIongase and its evolution 
history is largely different from those of other microalgae andprotists [20,22,23]. 

Figure 1. Properties analysis of PhtEL05. (A) Prediction of seven transmembrane heUces 
in PhtELOS by onhne analysis of TMHMM program (Trans-Membrane prediction using 
Hidden Markov Models). The predicted regions of transmembrane helices are shown in 
red, other regions are predicted to be either inside (in blue) or outside (in pink) the 
membrane. (B) Multiple peptide sequence alignment (partly shown) was performed using 
the Clustal W and ESPript 3, highlighting the typical motifs of A5-elongase FLHXYHH 
and MYSYY which are underlined and boxed. Protein sequences used are the same as that 
in Figure 2. 
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Figure 2, Phylogenetic dendrogram of fatty acid elongase families. The evolutionary 
relationship was inferred using the Neighbor- Joining method. The percentage of replicate 
trees in which the associated taxa clustered together in the bootstrap test (1000 replicates) 
are shown above the branches. Filled circles at nodes indicate phylogenetic branches that 
were also recovered by using maximum-parsimony algorithms. GenBank accession 
numbers of the sequences are ABR67690 (PavELOS; Pavlova viridis), AAV33630 
(PasELOS; Pavlova sp. CCMP459), ACR53360 (PycELOS; Pyramimonas cordata), 
BAE71129 (MapELOS; Marchantia polymorpha), AAV67798 (OstELOS; Ostreococcus 
tauri), AAV67800 (ThpELOS; Thalassiosira pseudonana), AAT85662 (MapEL06; 
Marchantia polymorpha), AAW70157 (PtEL06; Phaeodactylum tricornutum), 
XP_003074750 (OstEL06; Ostreococcus tauri), AAV67799 (ThpEL06; Thalassiosira 
pseudonana), ACK99719 (PaiEL06; Parietochloris incisa), XP_001780388 (PhpEL06, 
Physcomitrella patens), AEA07666 (C00ELO6; Conidiobolus obscurus), ACR53359 
(PycEL06; Pyramimonas cordata), ADE06662 (MoaEL06; Mortierella alpina), 
ADN94475 (PapEL09; Pavlova pinguis), ADN94476 (ResEL09; Rebecca salina), 
AAL37626 (IsgEL09; Isochrysis galbana). TsELOl (Thrasutcohytrium sp. ATCC26185, 
ref 22); PtELOS (Phaeodactylum tricornutum, arrow shows this study). 
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2.3. Functional Analysis in Yeast: Confirmation of PhtELOS's Activity as A5-Elongase 

To verify whether the putative PhtELOS has A5-elongase activity or not, we heterologously 
expressed this gene in P. pastoris. EL05 expression vector was constructed by using plasmid 
pHBM906 which allows EL05 ORE to be driven by the inducible AOXl promoter [26]. Transgenic 
yeast cells expressing the PhtELOS ORE were fed with precursor fatty acids (A5-) C20 PUFAs 
(referring to ARA and EPA [9,12]) and then were analyzed for fatty acid profiling. Clearly, control 
strain with empty vector pHBM906 saw no new fatty acids formed, whereas the EL05 -expressing 
cells fed with ARA or EPA were shown to have a detectable amount of DTA or DPA accordingly 
(Figures 3 and 4, and Table 2). This result indicated that upon addition of (A5-) C20 PUFA substrate 
new fatty acids of DTA and DPA were formed as immediate products. However, it appeared that 
PhtEL05 displayed a higher rate of converting EPA into DPA(n-3) (93.1%) than that of converting 
ARA into DTA (79.4%). The difference in efficiency was confirmed by using another expression 
vector pA0815, which exhibited the rate of 90.8% and 83.2%, respectively (Table 3). It is likely that 
the difference in conversion efficiency may reflect PhtELOS's substrate preference. 

Figure 3. Comparison of fatty acid profiles of the control and PhtELOS-expressing Pichia 
cells. The representative control strain PHCOl (transformed with empty vector pHBM906) 
and representative EL05 -expressing strain PHE5.01 were grown for 3 days with or without 
adding substrate and subjected to FA analysis. Samples of 100 [iM of (A5-) C20 PUFAs 
were exogenously fed as substrates. (A): control strain PHCOl; (B): PhtELOS-expressing 
strain PHE5.01. Fatty acid profiles of immediate product from substrate were clearly 
observed in gene expressed cells but not in controls. Stars indicate substrates which were 
initially fed and detected as left-over in samples. 
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Figure 4, Comparison of fatty acid profiles (partly shown) of the control and co-expressed 
Pichia ceUs fed with substrates. Control strain PACOl (transformed with empty vector 
pA0815) and co-expressed strain PDEOl (transformed with pAO-D4E5) were fed with 
100 )j.M of (A5-) C20 PUFAs and grown for 3 days, followed by FA analysis. Shown are 
fatty acid GC profiles of indicated strains fed with ARA (A), EPA (B) and ARA & EPA (C). 
Fatty acid profiles of immediate product from substrate were clearly observed in 
co-expressed cells but not in controls. Stars indicate substrates which were initially fed and 
detected as left-over in samples. 
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Table 2. Fatty acid composition of transgenic P. pastoris GSl 15. 



Fatty Acid 








P. pastoris with Plasmids 








Composition 
(% of Total Fatty 
Acids) 


PACOl(E) 


PDEOl(E) 


PACOl(A) 


PDEOl (A) 


PACOl(EA) 


PDEOl (EA) 


PACOl(-) 


PDEOl(-) 


C14:0 


9.75 ±0.13 


15.56 ±0.23 


9.77 ±0.11 


16.2 ± 0.09 


10.12 ±0.24 


17.12 ±0.23 


8.98 ±0.19 


13.79 ±0.24 


C16:0 


3.01 ±0.13 


10.08 ± 0.32 


2.98 ±0.15 


9.90 ±0.14 


3.11 ±0.18 


9.29 ± 0.27 


3.48 ± 0.12 


9.25 ± 0.21 


C17:0 


1.01 ± 0.07 


1.24 ±0.04 


1.23 ± 0.05 


1.13±0.15 


1.31 ±0.13 


1.47 ± 0.09 


1.36 ± 0.09 


1.09 ± 0.04 


C17:l 


2.90 ±0.18 


1.13±0.15 


3.01 ±0.17 


1.13±0.15 


3.32 ±0.14 


1.59±0.11 


2.50 ±0.10 


0.81 ± 0.08 


C18:0 


6.89 ±0.33 


6.03 ± 0.39 


7.24 ± 0.32 


7.44 ± 0.22 


7.54 ±0.32 


6.69 ± 0.23 


6.80 ± 0.22 


6.43 ±0.10 


C18:l «-9 


2.87 ±0.14 


2.31 ±0.18 


2.79 ±0.12 


2.00 ±0.11 


3.07 ±0.16 


2.90 ±0.19 


2.24 ±0.16 


1.77 ±0.07 


C18:l n-1 


42.86 ±2.68 


34.04 ±2.01 


39.13 ±2.05 


32.95 ± 2.92 


38.55 ± 1.63 


29.10 ± 1.92 


38.14 ±2.398 


33.46 ± 1.73 


C18:2 «-6 


19.09 ± 0.75 


21.31 ±0.67 


18.67 ±0.33 


20.28 ± 0.64 


17.27 ±0.47 


17.32 ±0.69 


28.63 ± 1.18 


27.22 ± 0.73 


C18:3 «-3 


4.62 ± 0.13 


4.39 ± 0.19 


4.68 ± 0.19 


4.07 ±0.13 


4.16 ±0.23 


4.14 ± 0.16 


6.14 ± 0.43 


4.69 ± 0.08 


C20:0 


1.79 ±0.19 


1.32 ± 0.23 


1.26 ±0.23 


1.67 ±0.12 


1.74 ±0.14 


1.66 ± 0.11 


1.72 ±0.08 


1.51 ± 0.05 


C20:4 «-6 (ARA) 


ND 


ND 


7.82 ± 0.23 


0.54 ±0.04 


3.32 ±0.17 


1.56 ±0.05 


ND 


ND 


C20:5 «-3 (EPA) 


5.27 ±0.02 


0.26 ± 0.02 


ND 


ND 


5.31 ±0.02 


0.30 ± 0.02 


ND 


ND 


C22:4 «-6 (DTA) 


ND 


ND 


ND 


0.24 ± 0.01 


ND 


0.30 ± 0.02 


ND 


ND 


C22:5 «-6 (DPA) 


ND 


ND 


ND 


2.44 ± 0.03 


ND 


3.00 ±0.07 


ND 


ND 


C22:5 «-3 (DPA) 


ND 


0.24 ± 0.03 


ND 


ND 


ND 


0.34 ±0.01 


ND 


ND 


C22:6 «-3 (DHA) 


ND 


2.35 ±0.05 


ND 


ND 


ND 


1.82 ±0.03 


ND 


ND 



Note: (1) Cells transformed with empty vector pA0815 (representative strain PACOl) or recombinant vector pAO-D4E5 (representative strain PDEOl) grown on different substrate 
containing media were tested. (2) PACOl(E): Strain PACOl with EPA; PDEOl(E): Strain PDEOl with EPA; PAC01(A): Strain PACOl with ARA; PDE01(A): Strain PDEOl with ARA; 
PAC01(EA): Strain PACOlwith EPA and ARA; PDE01(EA): Strain PDEOl with EPA and ARA; PACOl(-): Strain PACOl without adding substrate; PDEOl(-): Strain PDEOl without 
adding substrate. ND: not detected or not detectable. 
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Conversation Rate (%) 



Addition of Single Substrate Addition of Double Substrates 



EPA^DPA(«-3) 
DPA(«-3)^DHA 

ARA^DTA 
DTA^DPA(n-6) 



90.8 
90.8 
83.2 
90.9 



87.9 
84.4 
67.9 
90.9 



Based on domain analysis of PhtELOS, we carried out domain deletion experiments to verify the 
flinction of key ELO-conserved regions. Yeast cells expressing the truncated EL05 deleted for 4VR 
region (4-vinyl reductase, EL05-A1) or EL05 deleted for 4VR region plus part of the first 
transmembrane helix (EL05-A2) were shown to have no alteration in fatty acids profiling (Figure 5). 
The results indicated that at least in transgenic yeast, deletion of A-terminal 4VR domain and partial 
transmembrane regions had little influence on the activity of A5-elongase. This is in support of the 
prediction that PhtEL05's core activity regions range from amino acid 90-328. Together, it is 
concluded that PhtELOS indeed possesses the activity of A5-elongase as expected. 



Figure 5. Fatty acid profiling patterns of transgenic Pichia cells expressing various 
versions of PhtEL05. Yeast cells transformed with empty vector pHBM906, and 
expression vectors containing EL05-full length, EL05-A1 and EL05-A2 were fed with 
(A5-) C20 PUFAs and analyzed for FA composition. (A) to (D): indicated strains fed with 
EPA; (E) to (H): indicated strains fed with ARA. 
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2.4. Co-expression ofPhtELOS and IsFAD4 Assembled Function of EPA and DHA Biosynthetic 
Pathway in Transgenic Yeast 

Having confirmed the specificity of PhtELOS in yeast by single gene expression, we wanted to 
know further what its performance would be in an assembled EPA and DHA biosjoithetic pathway. 
We thus co-expressed microalgal A5-elongase and A4-desaturase in yeast to see if they were functional 
in a non-native system. The co-expression vector was successfully constructed using the stable vector 
pA0815 [26], which allowed PhtELOS and IsFAD4 cassettes to be stacked in a cascade through linker 
sites of BgUl and BamHl. The resultant target construct pAO-D4E5 contained EL05 and FAD4 as a 
cascade driven by strong inducible promoter pAOXl (Supplementary Information, Figure SI). 

Yeast can only synthesize the shortest PUFAs like LA and ALA, it needs the addition of 
(A5-) C20-PUFAS to continue the DHA biosynthesis via microbial A4-pathway [9]. Control strain 
PACOl and EL05-FAD4 co-expressed strain (PDEOl) were exogenously fed with ARA and/or EPA. 
Fatty acid analysis indicated that samples of control cells were unable to detect any new fatty acids 
formed. In contrast, samples of strain PDEOI expressing EL05 and FAD4 were new fatty acids 
detected upon the addition of (A5-) Cio-PUFAs (Figure 4 and Table 2). For instance, fed with ARA 
alone, strain PDEOl was shown to produce DTA and n-6 DP A in comparison with control samples 
where no new fatty acids identified; this indicated that co-expression indeed led to the conversion of 
single substrate ARA into intermediate product DTA and the latter subsequently into product n-6 DPA 
(Figure 4A). Fed with single substrate EPA, n-3 DPA and DHA were identified as new constituents 
besides the left-over EPA in FAME samples of strain PDEOl. In this case, product n-3 DPA was 
formed fi-om EPA and subsequently transformed into DHA (Figure 4B). Consistently, fed with ARA 
and EPA as dual substrates, four above mentioned products and two left-over substrates were all 
detected in PDEOl FAME samples. Taken together, this suggested that the co-expression successfully 
reconstituted the function of both A5 elongation and A4 desaturation orientating the n-3 pathway. 

2.5. PhtELOS 's Converting Efficiencies Varied with Substrates 

To test the effect of substrate competition, cells of double gene co-expressed strain PDEOl were fed 
with the same molar concentration of ARA and EPA concurrently. FA compositions and converting 
rates were determined to relatively estimate the enzymatic activity or preference upon type of (A5-) 
C20-PUFA substrates. As revealed in Tables 2 and 3, in the presence of single substrate, the rates of 
conversion EPA^DPA {n-3), DPA {n-3) ^DHA, ARA^DTA and DTA^DPA {n-6) were 90.8%, 
90,8%, 83.2%) and 90.9%), respectively. However in the presence of dual substrates, the rate of 
conversion (ARA^DTA) dropped drastically from 83.2% to 67.9%, while conversion rates of other 
substrates showed little change, ranging from 84.4% to 90.9%. The relatively lower conversion rate 
upon ARA was also observed in EL05 single gene expression in strain PHD5.01 (79.4% versus 
93.1%)). Obviously, compared to EPA, fewer molecules of ARA were incorporated with elongase 
PhtELOS. It is interpreted that PhtELOS is likely to have preference for EPA among (A5-) C20-PUFA 
substrates. This finding is consistent with the conclusion that A5-elongase is one of the rate-limiting 
enzymes involved in LC-PUFA biosynthesis [2,25]. 
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3. Discussion 

Diatoms are successful groups of unicellular eukaryotic algae playing important roles in global 
carbon and silica pools. P. tricomutum is one of the most widely utilized model systems for studying 
the ecology, physiology, and molecular biology of diatoms. Although the genome of this model diatom 
has been sequenced, the current knowledge on fatty acid metabolic organization and biochemistry 
remains fragmentary [24,25]. Since the availability of the genome, the PUFA sjmthesis associated 
enzymes such as thioesterases, elongases, desaturases, acyl-CoA synthetases and acyltransferases 
have been increasingly explored [19,23,25]. However, P. tricornutum' s fatty acid A5-eIongase 
has not been investigated. We therefore conducted the identification and function analysis of 
P. tricornutum A5-eIongase. 

We present herein three lines of evidence verifying that PhtEL05 functions as fatty acid 
A5-elongase. First, sequence analysis indicated that EL05 is an ELO-family protein, having two 
typical motifs of FLHXYHH and MYSYY. The motif MYSYY is the typical characteristic for all 
microalgal A5 -elongases reported; and it may be crucially required for its enzymatic activity. It is 
supported by the properties of Thrasutcohytrium TsELOl. Although the protein sequence of TsELOl 
is largely distant fi-om most A5-elongases, it possesses quite high activity of C20-A5 elongase [22]. 
This is likely due to the existence of motif MYSYY. Second, results of PhtEL05 single and double 
gene expression in yeast not only verified that PhtEL05 indeed possesses the activity of A5-elongase, 
but also demonstrated that gene stacking of microalgal A5-elongase and A4-desaturase can reconstitute 
the function of the EPA and DHA biosynthetic pathway in the yeast host. Upon addition of 
(A5-) C20-PUFA substrates, both EL05 single- and double-expression strains were able to form new 
fatty acids of C22 PUFAs, strongly suggesting that PhtELOS was a key player in orienting DHA 
synthesis viaA4-pathway [9]. Third, PhtEL05 was shown to have higher conversion efficiencies upon 
EPA than ARA, implying its preference for «-3 C20-PUFA. This observation is quite different from 
EL05 of Pavlova and Thalassiosira which appeared to have little differences on (A5-) C20-PUFA 
substrates [20,21]. To check if PhtEL05 would have activity on saturated C20 fatty acid, we did the 
C20:0 feeding experiment and the results demonstrated that this saturated fatty acid is not the substrate 
of PhtEL05 (Supplementary Information, Figure S2). 

In this study a high efficiency of converting (A5-) C20-PUFA substrates was repeatedly observed in 
EL05 single and EL0-FAD4 double expressing cells. This is largely different from EL05s of other 
microalgae where the conversion rates were generally less than 50% [12,20]. The high conversion rate 
may refiect the high enzymatic activity of PhtEL05 and IsFAD4. Initially we wanted to co-express 
Phaeodactylum A4-desaturase and A5-elongase but failed to get A4-desaturase functional; thus we then 
made use of previously described /. sphaerica A4-desaturase which had been proven to be efficient in 
converting substrate DPA [18]. Furthermore, PhtELOS and IsFAD4 were constructed under a strong 
inducible promoter which may have facilitated the functional stacking of both enzymes in Pichia. 
Gene stacking through a combined construct was proven to have advantages such as fewer selective 
markers and higher expression level [27,28]. However, such very high conversion efficiencies were 
probably attributed to the continuous strong inducing condition we applied: cells were batch-fed with 
methanol every 24 h. Expression patterns of target genes may support the explanation. By using 
Quantitative real-time PGR, mRNA levels of EL05 and FAD4 were shown to be steadily high over the 
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period of cultivation, indicating that the target genes were stably expressed at a high level under 
current inductions (Supplementary Information, Figure S3). All these factors have contributed to 
maintaining the high activities of A5-elongase and A4-desaturase in the Pichia host. 

In nature, yeast including P. postoris cannot synthesize PUFAs longer than LA and ALA [2,9]. 
Introduction of microalgal EL05 and FAD4 into yeast cells can reconstruct the flinctional routes 
orienting EPA and DHA biosynthesis, as part of the microbial A4-pathway (Figure 6). Based on our 
finding, it is concluded that synthesis of C22 PUFAs can be realized via functional assembly of 
EL05/FAD4 in the presence of (A5-) C20 PUFA substrates. EL05 is likely to have a higher converting 
activity upon n-3 C20 PUFA substrate, whereas FAD4 activity remains comparable in both n-3 and n-6 
pathways (Table 3, Figure 6). It is postulated that in transgenic yeast, Phaeodactylum EL05 may 
determine the n-3 pathway as a dominant synthetic route for LC-PUFA production. 

Figure 6. Schematic illustration of the heterologous assembly of microalgal EL05 and 
FAD4 implementing EPA and DHA syntheses in transgenic yeast. 



LA(18:2 A9, 12) 



X 



ALA(I8:3 A9,12,15) 



Addition of (A5-) C20 PUFAsubstrate 



X 



20:4A5,8,11,14 
Arachidonic acid (ARA) 



Lower conversion rate 



^PhtEloS^^ 



20:5A5,8,11,14, 17 
Eicosapentaenoic acid (EPA) 

\ Higlier conversion rate 



22:4A7,10,13,16 
Docosatetraenoic acid (DTA) 

IsFad4 

22:5A4,7,10,13,16 ^ 
Docosapentaenoic acid (DPAn-6) 



\ Comparable rate 




^v:^ 22:5A7, 10,13,16, 19 

Docosapentaenoic acid (DPAn-3) 



]/ Comparable conversion rate 

22:6A4,7,10,13,16,19 
Docosahexaenoic acid (DHA) 



4, Experimental Section 

4.1. Strains and Culture Conditions 



The diatom P. tricornutum Pt9 (CCMP633) was obtained from the Institute of Hydrobiology, 
Chinese Academy of Sciences (Wuhan, China). P. tricornutum was grown in f/2-enriched artificial sea 
water (f/2AW, pH8.5) medium at 25 °C with continuous aeration and illumination (50 mol 
photons m~^s~', by cool daylight fluorescent tubes). P. pastoris strain GS115 was grown and 
maintained in YPD medium. 
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4.2. Sequence Analysis 

DNA or protein sequence alignment and neighbor-joining tree construction were done with Clustal 
W and MEGA 5.2.2. Sequence distances were analyzed by DNA-star. Structural domain prediction 
was carried out using the online server program TMHMM and ESPript 3. 

4.3. Identification and Isolation of the A5-Elongase cDNA from P. tricomutum 

DNA, RNA and cDNA preparation were performed by standard protocols. Molecular tool enzjmies 
and reagent kits were used according to instructions provided by the manufacturers. At the time of 
study there was only a partial sequence of P. tricomutumt mRNA available (NCBI Reference 
Sequence: XM_002 176650.1). This partial start and partial stop fragment was linked to a sequence in 
chromosome 1. In order to isolate its frill length cDNA and putative gene, three forward primers 
PhteIo5-Ul, -U2 and -U3 and one reverse primer Phtelo5-Dl (Table 1) were designed for DNA 
amplification against genomic DNA and the first strand cDNA. For this purpose, the High Fidelity 
PCR system (Roche) was employed to minimize the error rate. The specific PCR products were gel 
purified and cloned into pMD18-T vector (TaKaRa, Dalian, China) after the fragments were prepared 
by the A-tailing procedure. Inserts were then fully sequenced with complete coverage. Assembly of the 
sequences allowed the identification of a full length of ORF from cDNA and the coding sequences. It 
was designated PhtELOS and subjected to cloning into yeast expression vectors. 

4.4. Plasmid Construction and Transformation of Yeast Cells 

Plasmids and primers used in this study are featured in Table 1 . PCR fragments were cloned into 
pMD18-T (Takara) and confirmed by sequencing. To characterize the function of EL05, the PhtEL05 
was cloned into pHBM906 and expressed in P. pastoris. Briefiy, PhtELOS was cloned into pMD18-T 
through cloning sites of Notl and Cpol to generate pMD-EL05. The full length of PhtELOS was 
released from pMD-EL05 by Notl and Cpol digestions, and subcloned into Notl and Q^ol-cut 
pHBM906 (P. pastoris expression vector). In the resultant vector pHBM-EL05, EL05 was placed in 
between the promoter and transcription terminator regions of the AOXl gene. Linearized by Sail, 
pHBM-EL05 was infroduced into P. pastoris sfrain GS115 by electroporation (Bio-Rad). 
Concurrently, the empty pHBM906 vector was infroduced into GSl 15 as confrol. For co-expression of 
EL05 and FAD4 in P. pastoris, the expressing vector pA0815 was used to stack the two genes in a 
cascade. First, IsFAD4 and PtEL05 were individually cloned into pA0815 via EcoRl cloning site to 
form vector pAO-FAD4 and pA0-EL05, respectively. Then, pAO-FAD4 was digested with BamHl 
followed by dephosphorylation with calf intestine alkaline phosphatase according to the 
manufacturer's instructions; and PhtEL05 expression cassette (5'AOXl-PhtEL05-TT) was amplified 
from vector pA0-EL05 with primer EL05BGL-F and EL05BGL-R using a high fidelity PCR. 
Finally, after digestion with BgRl, the PhtEL05 cassette was ligated to fiamM-digested and 
dephosphorylated vector pA0-FAD4, resulting in the target vector designated pAO-D4E5. Various 
PCR and sequencing were carried out to confirm the accuracy of sequence and structure of the stacked 
cassettes prior to the fransformation. 
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4.5. Heterologous Expression ofPtELOS and IsFAD4 in P. pastoris 

To verify the function of PhtELOS, the recombinant vectors pHBM-EL05 and/or pAO-D4E5 were 
introduced in Pichia pastoris. The representative strains of three independent positive transformants 
were selected and grown on 50 mL MGY liquid medium (1.34% YNB with amino acids and 
ammonium sulfate, 10~^% biotin and 1% glycerol) on a shaker (220 rpm) at 28 °C. Cultures of the 
stationary phase were harvested and washed with sterile deionized water twice and inoculated (at final 
rate of OD600 = 1) into the 500 mL glass Erlenmeyer flasks containing 50 mL MM medium (1.34% 
YNB with amino acids and ammonium sulfate, 10~^% biotin and 0.5% methanol). Methanol, 0.5% 
(v/v), was supplemented every 24 h to keep inducing gene expression. All yeast cultures were grown 
for 72 or 96 h at 20 °C and used for fatty acid analysis. 

Heterologous gene expressions were estimated by Quantitative real-time PCR. Total RNA was 
isolated from cells using the TRIzol reagent (Invitrogen China Limited, Beijing, China). First-strand 
cDNA was synthesized using a PrimeScript RT reagent kit with gDNA eraser (TaKaRa Biotechnology 
(Dalian), Dalian, China). qRT-PCR was performed using iCycler iQ5 real-time PCR system (Bio-Rad, 
Hercules, CA, USA) and SYBR Premix Ex Taq II (Tli RNaseH plus kit) (TaKaRa Biotechnology 
(Dalian), Dalian, China), according to the manufacturer's protocols. Full-length PhtELOS or IgFAD4 
cDNA was amplified with primer pairs listed in Table 1. The yeast actin gene ACTl was used as an 
internal standard. The relative mRNA level of the target gene was normalized against that of standard. 

4.6. PUFA Substrate Feeding 

P. pastoris transgenic strains carrying various expressing vectors were grown in BMGY medium 
individually. Each overnight culture was diluted to an OD600 of 0.5 and refreshed by growing for 
further hours till the OD600 value reached approximately 1.0. The cultures were harvested by 
centrifugation and resuspended in the same volume of fresh BMGY medium containing 1% NP40, 
0.5%) methanol and exogenously fed with 100 |a,M each of (A5-) C20FUFAS. The cultures were grown 
for 72-96 h with addition of 0.5%) methanol every 24 h. Cell were harvested, washed once with three 
volumes of 0.5%) Triton X-100, and once with three volumes of distilled water. The pellets were 
subjected to analysis of fatty acid composition (% of total fatty acids). 

4.7. Fatty Acid Analysis 

Microalgae or yeast cells were dried by the Vacuum Freeze-drying System. Fatty acid methyl esters 
(FAMEs) were extracted with petroleum ether and trans-methylated with 0.4 M NaOH in 
methanol [29]. All samples were analyzed using a 7890A gas chromatography (Agilent technologies, 
Santa Clara, CA, USA) equipped with a flame ionization detector (FID) and an HP-FFAP capillary 
column (30m x 250 \im x 0.25 |xm). High purity nitrogen was used as carrier gas. Standards of fatty 
acid mixtures were purchased from Sigma-Aldrich (St. Louis, MO, USA), FAMEs were identified by 
comparison of retention times with those of the authentic standards. The relative amount of FAME was 
quantified by comparing each peak area with that of the internal standard. 1 |xg/|xL of behenic acid 
(22:0) was added into samples as internal standard. The conversion efficiency was calculated as 
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percentage of the relative amount of product divided by the sum of product and substrate (which is 
left-over), namely conversion rate (%) = [product]/([substrate] + [product]) x 100% [18,30]. 

5. Conclusions 

We herein report the characterization and flinctional analysis of P. tricornutum A5-elongase gene 
PhtEL05. Heterologous expression in Pichia confirmed that EL05 possesses strong activity of 
A5-elongase capable of elongating (A5-) C20 PUFA substrates. Substrate competition revealed that 
PhtEL05 had a higher converting activity towards 20:5n-3 than 20:4n-6 fatty acids. Functional 
stacking of Phaeodactylum EL05 and Isochrysis FAD4 in Pichia reconstituted a high-efficiency 
biosynthetic pathway leading to transgenic production of DHA. 
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